Text Skew Angle Detection in Vision-Based Scanning of Nutrition Labels

نویسنده

  • Tanwir Zaman
چکیده

An algorithm is presented for text skew angle detection in vision-based scanning of nutrition labels on grocery packages. The algorithm takes a nutrition label image and applies several iterations of the 2D Haar Wavelet Transform (2D HWT) to downsample the image and to compute the horizontal, vertical, and diagonal change matrices. The values of these matrices are binarized and combined into a result set of 2D change points. The convex hull algorithm is applied to this set to find a minimum area rectangle containing all text pixels. The text skew angle is computed as the rotation angle of the minimum area rectangle found by the convex hull algorithm. The algorithm’s performance is compared with the performance of the algorithms of Postl and Hull, two text skew angle algorithms frequently cited in the literature, on a sample of 607 nutrition label images whose text skew angles were manually computed by two human evaluators. The median text skew angle error of the proposed algorithm, Postl’s algorithm, and Hull’s algorithm are 4.62, 68.85, and 20.92, respectively. Keywords— computer vision; text skew angle detection; OCR; 2D Haar wavelet transform; wavelet analysis

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Text Pre-processing and Text Segmentation for OCR

Optical Character Recognition (OCR) systems have been effectively developed for the recognition of printed script. The accuracy of OCR system mainly depends on the text preprocessing and segmentation algorithm being used. When the document is scanned it can be placed in any arbitrary angle which would appear on the computer monitor at the same angle. This paper addresses the algorithm for corre...

متن کامل

Improved Skew Detection and Correction Approach Using Discrete Fourier Algorithm

The main objective of Image processing is to convert an image into digital form and perform some operations on it, in order to get an enhanced image or to extract some useful information from it. But when they are needed to be converted into electronic form, it has to be done through scanning. One of the major problems in this field is that if the document to be read is not placed at 90. This w...

متن کامل

Skew detection for complex document images using robust borderlines in both text and non-text regions

0167-8655/$ see front matter 2008 Elsevier B.V. A doi:10.1016/j.patrec.2008.06.008 * Corresponding author. Address: National Lab on University, Beijing 100871, China. Fax: +86 10 62755 E-mail address: [email protected] (H. Liu). A new skew detection method for complex document images based on robust borderlines extracted from both text and non-text regions is proposed in this paper. First, bor...

متن کامل

Improved Nearest Neighbor Based Approach to Accurate Document Skew Estimation

The nearest-neighbor based document skew detection methods do not require the presence of a predominant text area, and are not subject to skew angle limitation. However, the accuracy of these methods is not perfect in general. In this paper, we present an improved nearest-neighbor based approach to perform accurate document skew estimation. Size restriction is introduced to the detection of nea...

متن کامل

Application of Radon Transform in Detecting Turning Angle of Bodies and in Reading Multi - Lingual Documents

Recently, image processing technique and robotic vision are widely applied in fault detection of industrial products as well as document reading. In order to compare the captured images from the target, it is necessary to prepare a perfect image, then matching should be applied. A preprocessing must therefore, be done to correct the samples’ and or camera’s movement which can occur during the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015